Preattentive Reading and Selective Attention for Document Image Analysis
نویسنده
چکیده
PixED (from Pixel to Electronic Document) is aimed at converting document images into structured electronic documents which can be read by a machine for information retrieval. The approach is based on the combination of perception and symbol reading which are the two processes involved when humans detect the organisation of a document. "Pre-attentive reading" denotes the physical segmentation related to perceptual organisation. "Selective attention" means that symbol reading is limited to specific sequences of symbols or to pre-attentively selected locations. An OCR provides the primary structured description of the document. PixED improves the quality of this description, completes the physical segmentation and adds a logical description. A distributed software architecture and an incremental strategy are defined to enable the integration of perception and symbol reading. The approach is tested on a set of documents composed of several pages which are gathered from proceedings of scientific conferences.
منابع مشابه
Applying Preattentive Visual Guidance in Document Image Analysis
In this paper, we present a novel methodology on document image analysis (DIA) which harnesses the mechanism of preattentive visual guidance in human vision. Summarizing the psychophysical research on preattentive vision, we suggest using two types of computations to simulate this biological process: the visual similarity clustering and visual saliency detection. Based on the computational impl...
متن کاملPersian Printed Document Analysis and Page Segmentation
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifyi...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملHarnessing Preattentive Processes for Multivariate Data Visualization
A new method for designing multivariate data visualization tools is presented . These tools allow users to perform simple tasks such as estimation, target detection, and detection of data boundaries rapidly and accurately. Our design technique is based on principles arising from an area of cognitive psychology called preattentive processing. Preattentive processing involves visual features that...
متن کاملSaccadic Object Recognition with an Active
An active vision system for saccadic camera gaze shifts and explorative scene analysis as a new integral approach to image understanding is proposed. The model includes several subsystems: preattentive peripheral feature detection, multi resolution foveal image identiication based on a hypercolumnar representation and object recognition by means of two memories for foveal identiications and xat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999